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Abstract 

The matrices of spanning rooted forests are studied as a tool for analysing the struc- 
ture of digraphs and measuring their characteristics. The problems of revealing 
the basis bicomponents, measuring vertex proximity, and ranking from preference 
relations / sports competitions are considered. It is shown that the vertex acces- 
sibility measure based on spanning forests has a number of desirable properties. 
An interpretation for the normalized matrix of out-forests in terms of information 
dissemination is given. 
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1 Introduction 

The matrices of routes between vertices are useful to analyse the structure 
of graphs and digraphs. These matrices are the powers of the adjacency ma- 
trix. In this paper, we consider the matrices of spanning rooted forests as an 
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alternative tool for analyzing digraphs. We show how they can be used for 
finding the basis (source) bicomponents of a digraph (Section 5), for measuring 
vertex proximity (Section 6), and for ranking on the base of preference rela- 
tions / sports competitions (Section 7). In the initial sections, we introduce 
the necessary notation (Section 2), present recurrent formulae for calculating 
the "forest matrices" (Section 3), and list some properties of spanning rooted 
forests and forest matrices (Section 4). 

Three features that distinguish the matrices of forests from the matrices of 
routes are notable. First, all column sums (or row sums) of the forest matrices 
are the same, therefore, these matrices can be considered as matrices of relative 
accessibility. Second, there are matrices of "out-forests" and matrices of "in- 
forests" , enabling one to distinguish "out-accessibility" from "in-accessibility" , 
which is intuitively defensible. Third, the total weights of maximum spanning 
forests are closely related to the Cesaro limiting probabilities of Markov chains 
determined by the digraph under consideration. 

2 Notation and simple facts 

2.1 Weighted digraphs, components, and bases 

Suppose that T is a weighted digraph without loops, V(r) = {1, . . . , n}, n > 1, 
is its set of vertices and E(T) the set of arcs. Let W = (w i3 ) be the matrix of 
arc weights. Its entry is zero if there is no arc from vertex i to vertex j in T; 
otherwise is strictly positive. In what follows, V is fixed, unless otherwise 
specified. If V is a subgraph of T, then the weight of T', w(T'), is the product 
of the weights of all its arcs; if E(T') = 0, then w(T') = 1 by definition. The 
weight of a nonempty set of digraphs Q is 



A spanning subgraph of V is a subgraph with vertex set V(T). The indegree 
id(v) and outdegree od(v ) of a vertex v are the number of arcs that come in v 
and out ofv, respectively. A vertex v is called a source if id(w) = 0. A vertex v 
is isolated if id(v) = od(v) = 0. A route (semiroute) is an alternating sequence 
of vertices and arcs v Q , e l5 v 1 , . . . , e k , v k with every arc e i being {y (resp., 
either (f j_ l5 v { ) or (v i: A P a ^ is a route with distinct vertices. A circuit 

is a route with v = v k , the other vertices being distinct and different from v Q . 
Vertex v is reachable from vertex z in T if v = z or T contains a path from z 
to v. 



(1) 





A digraph is strongly connected (or strong) if all of its vertices are mutually 
reachable and weakly connected if any two different vertices are connected by 
a semiroute. Any maximal strongly connected (weakly connected) subgraph 
of T is a strong component, or a bicomponent (resp., a weak component) of V. 
Let r 1; . . . , T r be all the strong components of V. The condensation (or factor- 
graph, or leaf composition, or Hertz graph) T° of digraph V is the digraph with 
vertex set {r 1; . . . , r r }, where arc (r i; Tj) belongs to E(T°) iff E(T) contains 
at least one arc from a vertex of Tj to a vertex of Tj . The condensation of any 
digraph T obviously contains no circuits. 

A vertex basis of a digraph T is any minimal (by inclusion) collection of 
vertices such that every vertex of T is reachable from at least one vertex of 
the collection. If a digraph does not contain circuits, then its vertex basis is 
obviously unique and coincides with the set of all sources [24,41]. That is why 
the bicomponents of T that correspond to the sources of T° are called the basis 
bicomponents [41] or source bicomponents of T. In this paper, the term source 
knot of T will stand for the set of vertices of any source bicomponent of T. 
In [23], source knots are called W-bases. 

The following statement [24,41] characterizes all the vertex bases of a di- 
graph. 

Proposition 2.1 A set U C V{T) is a vertex basis of V if and only if U 
contains exactly one vertex from each source knot of V and no other vertices. 

Schwartz [36] referred to the source knots of a digraph as minimum 
P -undominated sets. According to his Generalized Optimal Choice Axiom 
(GOCHA), if a digraph represents a preference relation on a set of alterna- 
tives, then the choice should be the union of its minimum P-undominated 
sets. 3 This choice is interpreted as the set of "best" alternatives. A review of 
choice rules of this kind can be found in [40]; for "fuzzy" extensions, see [35]. 

2.2 Matrices of forests 

A diverging tree is a weakly connected digraph in which one vertex (called the 
root) has indegree zero and the remaining vertices have indegree one. A di- 
verging tree is said to diverge from its root. Spanning diverging trees are 
sometimes called out-arborescences. A diverging forest (or diverging branch- 
ing) is a digraph all of whose weak components are diverging trees. The roots 
of these trees are called the roots of the diverging forest. A converging tree 
(converging forest) is a digraph that can be obtained from a diverging tree 

3 This union is also called the top cycle and the strong basis of T. 



(resp., diverging forest) by the reversal of all arcs. The roots of a converging 
forest are its vertices that have outdegree zero. In what follows, spanning di- 
verging forests in T will be called out-forests of T; spanning converging forests 
in T will be called in- forests of V . 

Definition 2.2 An out-forest F of a digraph V is called a maximum out-forest 
of T if T has no out-forest with a greater number of arcs than in F. 

It is easily seen that every maximum out-forest of V has the minimum 
possible number of diverging trees; this number will be called the out-forest 
dimension of V and denoted by d! . It can be easily shown that the number 
of arcs in any maximum out-forest is n — d'; in general, the number of weak 
components in a forest with k arcs is n — k. 

By jr^fT) = jr*^ and F£*{V) = we denote the set of all out- 

forests of T and the set of all out-forests of T with k arcs, respectively; T k ~^ 3 
will designate the set of all out-forests with k arcs where j belongs to a tree 
diverging from i\ J^*~^i = |J^~q T % ^ 3 is the set of such out-forests with all 
possible numbers of arcs. The notation like will be used for sets of out- 
forests that consist of k trees, so = JF*^, k = 1, . . . ,n. Thus, the *^ 
sign relates to out-forests; the corresponding notation with — >* , such as J 7 ""*, 
relates to in-forests, i.e., * images the root(s). 

Let 



n-d' 

°k 



(2) a k = w{m, fc = 0,l,...; a = w{j^) = J] 

fc=0 

By (2) and (1), a k = whenever k > n — d'; o~ — 1. 
We also introduce the parametric value 

n—d' n—d' 

(3) ^) = $>CFrk fc = E^ rfc > r>0 > 

k=0 k=0 

which is the total weight of out-forests in T provided that all arc weights are 
multiplied by r. 

Consider the matrices Q k = (qfj), k = 0, 1, . . . , of out-forests of V with k 
arcs: the entries of Q k are 

(4) 4 = w(?r r )- 

By (4) and (1), Q k = whenever k > n — d'; Q = I. 
The matrix of all out-forests is 

n-d' 

(5) Q = ( qij ) = Y,Qk with entries Qij = w(^). 

k=0 



We will also consider the normalized matrices of out-forests: 

(6) J k = a k 1 Q k , k = 0,...,n-d' ] J = (J y ) = a~ x Q 
and the parametric matrices 

n-d' 

(7) Q(r) = Y,Q^ k and J{t) = c-\t)Q{t), t>0, 

k=0 

where a k , a, and cr(r) are defined by (2) and (3). 

The normalized matrix of maximum out-forests J n _ d i will also be denoted 
by J=(J ij ): 

(8) J = J n -d> ■ 

3 Counting forests by means of linear algebra 

An algorithmic description of maximum out-forests was given in [2]. Another 
algorithm for the enumeration of out-forests can be obtained by adding a ficti- 
tious vertex along with the arcs (0, v) for all vertices v of V and enumerating 
the diverging trees in the supplemented digraph. 

The column Laplacian matrix of T is the n x n matrix L = L(T) = (^ ) 
with entries = —w^ whenever j ^ i and £ u — — ^2 ^ui hJ = 1j • • • > n - ^his 

matrix has zero column sums; in [10] we denoted it L'\ now, for simplicity, 
designation L is used. A row Laplacian matrix differs from the column Lapla- 
cian matrix by the diagonal only: its diagonal is such that the row sums are 
zero. The row and column Laplacian matrices are singular M-matrices (see, 
e.g., [10, p. 258]). Their index is 1 [10, Proposition 12]. The spectra of the 
row Laplacian matrices were studied in [1]. 

The following theorem provides a method to calculate the forest matrices 
by means of linear algebra. 

tr(LQ ) 

Theorem 3.1 [10]. Q k+1 = * k+1 I - LQ k ; a k+1 = -j^p fc = 0, 1, 

Hence, 

(9) Qk + i = ^fftl-LQ k , k = 0,l,.... 

dcf 

Consider the matrices L k = a k I — Q k , k = 0, 1, . . . . Obviously, L k is the 
Laplacian matrix L(T k ) of the digraphs T k of out-forests of T: the arc weights 
in T k are the off-diagonal entries of Q k . Then, by Theorem 3.1, we have 



Corollary 3.2 L k+1 = LQ k and tr(L k ) = ka k , k = 0, 1, 



This implies the following recurrent formula for L k+1 : 
Corollary 3.3 L k+1 = L (^j^I - L k ) , k = 1,2, . . . . 

4 Properties of the forest matrices 

A number of results on the forest matrices are presented in [10]. Some of them 
are collected in the following theorem. 

Theorem 4.1 [10]. 1. Matrices J k , k = 0, . . . , n — d! , J, and J(r) are column 
stochastic. 

2. For any r > 0, Q(t) = adj(J + tL) and cr(r) = det(J + tL) hold, whence, 
J(r) = (I + rLy\ 

3. LJ = JL = 0. 

4. J is idempotent: J 2 = J. 

5. J = lirn^oo J(t) = lim 7 -^ 00 (7 + r L)^ 1 . 

6. rank(J) = d'; rank(L) —n — d'. 

7- Q k = Zto°U-L)\ k = 0,1, 

8. J is the eigenprojection of L. 

Item 2 of Theorem 4.1 is a parametric version of the matrix-forest theo- 
rem [15]. 

To formulate the topological properties of the matrix J, the following no- 
tation is needed. 

Let K = \J i K i , where K i are all the source knots of Y; let Kf be the set 
of all vertices reachable from K i and unreachable from the other source knots. 
For any k G K, K(k) will designate the source knot that contains k. For any 
source knot K of T, denote by the restriction of T to K and by T_k the 
subgraph with vertex set V^(r) and arc set -E'(r) \ E(Tk). For a fixed K, T 
will designate the set of all spanning diverging trees of Y K , and V the set of 
all maximum out-forests of T_k- By T k , k e K, we denote the subset of T 
consisting of all trees that diverge from k, and by r p K *- J >i ^ j £ V(T), the set 
of all maximum out-forests of T_k such that j is reachable from some vertex 
that belongs to K in these forests. J k » is the kth row of J. 

Theorem 4.2 [2]. Let K be a source knot in T. Then the following statements 
hold. 

1. Jij -v^ {i G K and j is reachable from i inT). 



2. Let keK. For any j e V(T), J kj = w(T k )w(V K *~ >j )/w(j : fy,). Furthermore, 
if j E K+, then J kj = J kk = w{T k )/w{T). 



3. ^2 Jkk — 1- In particular, if k is a source, then J kk = 1. 

kdK 

4. For any k 1 ,k 2 e K, J k2 , = (w(T k2 )/w(T kl ))J kl . holds, i.e., the rows k x 
and k 2 of J are proportional. 

We say that a weighted digraph T and a finite homogeneous Markov chain 
with transition probability matrix P inversely correspond to each other if 



where a is any nonzero real number. 

If a Markov chain inversely corresponds to T, then the probability of tran- 
sition from j to % 7^ j is proportional to the weight of arc in T and is if 
E(T) does not contain (i, j). We consider such an inverse correspondence in 
order to model preference digraphs in Section 7: in this case, the transitions 
in the Markov chain are performed from "worse" objects to "better" ones, so 
the Markov chain stochastically "searches the leaders." 

Theorem 4.3 For any finite Markov chain, its matrix of Cesdro limiting 
probabilities coincides with the matrix J of any digraph inversely corresponding 
to this Markov chain. 

Theorem 4.3 follows from the Markov chain tree theorem [26,27], which, 
in turn, can be immediately proved using item 8 of Theorem 4.1 and a result 
of [34] (see [10]). Another proof of Theorem 4.3 can be found in [8]. A review 
on forest representations of Markov chain probabilities is given in [9] . For an 
interpretation of J(r) in terms of Markov chains we refer to [3]. 

5 Detecting the source knots of a digraph 

In this section, we show that entry ij of (7 + rL) -1 , where r > 0, is nonzero if 
and only if j is reachable from i in T and that J points out the source knots 
of r and the vertices reachable from each of them. 

The reachability matrix of a digraph is the matrix R = {r i ,) nxn with entries 



(10) 



I-P = aL T , 




0, otherwise. 



1, if j is reachable from i, 



It follows from the definition (7) of J(r) that for every r > 



(11) 



R = sga(J(r)), 



where the signum function operates entrywise. 

Recall that, by item 2 of Theorem 4.1, J(r) can be calculated as follows: 
J(t) — (I + tL)~ 1 . So we obtain 

Proposition 5.1 For every r > 0, R = sgn ((/ + rL) _1 ). 

An algebraic way to recover the bicomponents of a digraph is to calculate 
the mutual reachability matrix, which is the Hadamard (entrywise) product of 
the reachability matrix and its transpose. Note that the standard algebraic 
means of finding the reachability matrix of a digraph is to compute (/ + 
v4) n_1 , where A is the adjacency matrix, or to successively calculate the power 
matrices (J + A) k until the positions of nonzero entries stabilize; then, in 
both cases, the nonzero entries of the resulting matrix should be replaced by 
ones [41]. The matrices (J — a A)~ l with sufficiently small a > can also be 
used for this end. 

The following result does not provide an effective way to find the reach- 
ability matrix, but it contributes to the understanding of the nature of the 
forest matrices. 

By (7), J(r) = a'^r) £ Q, k) r n ~\ where Q, k) d ^ Q n _ k . It turns out 

k=d' 

that all information about the digraph reachability is accumulated in Q^ d ,^ 
and Q( d , +1 y This follows from 

Proposition 5.2 For any i,j e V(r) and any path from i to j in T, there 
exists an out-forest in FT^ 1 U jF^r^ that contains this path. 

Proof. For the given path and any maximum out-forest in T, consider their 
join and remove all arcs of the out-forest that come in the vertices of the 
path but do not belong to the path. The resulting subgraph contains neither 
circuits nor vertices v with id(v) > 1, i.e., it is an out-forest. It is rooted at 
i and contains at least n — d! — 1 arcs, including the arcs of the given path. 
Hence, it belongs to f 1 ^ 3 U ■ □ 

This implies 

Corollary 5.3 R = sgn(j (d;) + J (d , +1) J , where J {k) = J n „ fc . 

In some cases, the main goal is to find the source knots and the vertices 
reachable from each of them. For example, as was noted in Section 2.1, this is 
the case in choice theory if the Generalized Optimal Choice Axiom (GOCHA) 
is adopted. Then the union K = |jf =1 Ki of the source knots is the set of 
"best" alternatives chosen on the base of a preference relation or a digraph of 
preferences [36] (cf. [40,25,35]). 



It turns out that J immediately reveals K. 

Definition 5.4 The top reachability matrix of V is the matrix R = (r^) nxn 
with entries 

, s ^ J 1, if i belongs to a source knot of V and j is reachable from i, 

y [0, otherwise. 

It follows from item 1 of Theorem 4.2 that 

Proposition 5.5 i? = sgn(J). 

The source knots of T can be disclosed by computing the mutual top reach- 
ability matrix, which is the Hadamard product of R and R T . It follows from 
Proposition 5.5 that 

Proposition 5.6 Vertices i and j belong to the same source knot iff J ^ Jj^O. 

If R is found by of approximate calculation based on item 5 of Theorem 4.1, 
the following statement can be of help. Recall that, by item 2 of Theorem 4.1, 
a = det(7 + L). 

Proposition 5.7 Let T be a digraph with all arc weights 1. Then 

(13) otherwise, *,J-1, 

where J^t) is the ij-entry of J{r) = (I + rL)~ x . 

Proposition 5.7 is formulated for unweighted digraphs, since the reachabil- 
ity relation does not depend on the weights of arcs. 

Proof. If d' = n, the claim is obvious. Let d' < n. Then, by (3) and (7), 
J(r) = B(t) + C(r) holds, where B(r) = (%) = a' 1 ^) EJ^f -1 r k Qk and 
C(r) = (cjj) = o- _1 (t) r n " d Q n -d'. By item 5 of Theorem 4.1, we have B(r) — > 
0, J(t) — > J and C(r) — > J as r — ► oo. 

Let r = cr 2 . Since the weights of all arcs are 1, a k > 1, k = 0, . . . , n — d', 
and t > 1 hold. 

Let = 1. Then, by Proposition 5.5, J ^ > 0, hence, q^~ d > 1, where 
= Qfc> fc = 0,...,n-d'. Therefore, J y (r) > Cy(r) > o-V)^' = 

Let now f y = 0. Then J {j = 0, hence, c^(r) = and Jy(r) = ^(r) < 



6 Forest based accessibility measures 

Formally, by an accessibility measure for digraph vertices we mean any func- 
tion that assigns a matrix P = (p iJ -) nXn to every weighted digraph T, where 
n = |^(r)| . Entry p- is interpreted as the accessibility (or connectivity, re- 
latedness, proximity, etc.) of j from i. 

Consider the accessibility measures P° ut = J(t), where J(r) is defined by 
(7), and P; n = (pg) with pg = w(J^^(t))/w(J^*(t)), where J^^* j (r) and 
JF^*(t) are, respectively, the jf*^*- 7 and JF^* for the digraph T(r) obtained 
from T by the multiplication of all arc weights by r. Parameter r specifies the 
relative weight of short and long ties in T. 

Definition 6.1 Accessibility measures P^ and P^> are dual if for every Y 
and every i, j G V(T), p\f{T) = p^(r'), where T' is obtained from T by the 
reversal of all arcs (preserving their weights). 

The following proposition results from the fact that the reversal of all arcs 
in T transforms all out-forests into in-forests and vice versa. 

Proposition 6.2 For every r > 0, the measures P° ut and P™ are dual. 

What is the difference in interpretation between P° ut and P^ n ? A partial 
answer is as follows. P° ut can be interpreted as the relative weight of % — > j 
connections among the out-connections of i, whereas P™ is the relative weight 
of i — > j connections among the in-connections of j. Naturally, these relative 
weights need not coincide. For example, a connection between an average 
man and a celebrity is usually more important for the average man. This 
example demonstrates that self-duality is not an imperative requirement to 
accessibility measures. The properties of several self-dual measures have been 
studied in [17]. 

The following conditions proposed in part in [17] can be considered as 
desirable properties of vertex accessibility measures. 

Nonnegativity. p { . > 0, i,j G ^(r). 

Reachability condition. For any i, j G V(T), (p^ = -v^ j is unreachable 
from i). 

Self-accessibility condition. For any distinct i, j G V^(r), (A) p, u > p^ and 
(B) Pa > Pji hold. 

Triangle inequalities for proximities. For any i,k,t G V^r), (A) p ki —p ti 
< Pkk -Ptk and ( B ) Pik -Pit < Pkk -Pkt hold - 



The triangle inequalities for proximities a counterparts of the ordinary 
triangle inequality which characterizes distances (cf. [16]). 

Let k,i,t G V(r). We say that k mediates between i and t if T contains a 
path from i to t, i 7^ k 7^ t, and every path from i to t includes k. 

Transit property. If k mediates between i and t, then (A) p ik > p it and 

(B) p kt > p it ■ 

Monotonicity. Suppose that the weight w kt of some arc (k, t) is increased 
or a new (k, t) arc is added to T, and Ap^, i,j e V(r), are the resulting 
increments of the accessibilities. Then: 

(1) Aftt > 0; 

(2) If t mediates between k and i, then Ap ki > Ap ti ; if k mediates between i 
and t then Ap it > Ap ik ; 

(3) (A) If t mediates between k and i, then Ap kt > Ap ki ; 
(B) If k mediates between i and t, then Ap kt > Ap it . 

Convexity. (A) If p ki > p ti and i 7^ k, then there exists a k to i path such 
that the difference p k j —p t j strictly decreases as j advances from k to i along 
this path. (B) If p ik > p it and i 7^ k, then there exists an i to k path such 
that the difference —Pj t strictly increases as j advances from % to k along 
this path. 

The results of testing P° ut and P™ are collected in 

Theorem 6.3 The measures P° ut and P* n satisfy all the above conditions not 
partitioned into (A) and (B). Furthermore, P° ut obeys all (A) conditions and 
P" 1 all (B) conditions. 

Proof. Let P = P° ut . Nonnegativity follows from the definition of P° ut and 
the positivity of arc weights. Reachability condition follows from Proposi- 
tion 5.1. Item (A) of the self-accessibility condition is true because, by (7), 
p it = awir^lr)), = aw{F*-> j {T)), and J*"*(t) C J=*^(r) (strictly), 
where a = w~ 1 (j'*~ > (r)) and J^*^ j (r) is the for the digraph T(r) ob- 

tained from T by the multiplication of all arc weights by r; the same with 
jF*^(r). Item (A) of the transit property is proved similarly. 

To prove (A) of convexity, rewrite item 2 of Theorem 4.1 in the form 
I = J(t) (I + tL). Consider entries ki and ti of J(r) (7 + tL). Since i 7^ k by 
assumption, we get p ki = r w^p^ -p kl ) and p ti = r «- ( ,(/^ + 
5 it , where S it — 1 if i — t and 5 it = otherwise. Hence, p ki —p ti +5 it = 

T Y Jj ^i w jii.iP k j-Ptj) - (Pki-Pti))- Since Pki 'Pti > 0, there exists j e V(T) 
such that w ji 7^ (and thus (j, i) E E(T)) and p kj —p tj > p ki —p ti - Applying 



the same argument to j instead of i, and so forth, we finally obtain k as the 
terminal vertex of this path, as desired. 

The triangle inequality follows from (A) of convexity (taking j = k). For 
the proof of monotonicity (items 1, 2, and 3A) we refer to [3, Proposition 11]. 

The corresponding statements for P™ follow similarly or by duality. □ 

It will be shown elsewhere that for a sufficiently small positive r, P° ut 
additionally satisfies (B) conditions, whereas P™ satisfies (A) conditions, and 
they both satisfy the following addition to monotonicity: Suppose that the 
weight w kt of some arc (k, t) increases or a new (k, t) arc is added; then for 
any i,j G V(r), (i ^ j or k ^ t) implies Ap kt > Ap^ . 

Consider now the accessibility measures P out = (p^) = J = lim.,-^^ P° ut 

and P m = lim^oo P" 1 . Having in mind Theorem 4.3, we call the limiting 
out- accessibility of j from i. 

Let us say that a condition is satisfied in the nonstrict form if it is not 
generally satisfied, but it becomes true after the substitution of > for >, < 
for < and "nonstrictly" for "strictly" in the conclusion of this condition. 

Similarly to Proposition 6.2 we have 

Proposition 6.4 The accessibility measures P out and P m are dual. 
The results of testing P out and P in are collected in 

Theorem 6.5 The accessibility measures P out and P m satisfy nonnegativity 
and the part of reachability condition, but they violate the "=>■" part 

of reachability condition. Moreover, P out satisfies, in the nonstrict form, 
items (A) of self- accessibility condition, transit property, monotonicity, and 
convexity, whereas P m satisfies in the nonstrict form items (B) of these con- 
ditions. P out satisfies (A) and P m satisfies (B) of triangle inequality for prox- 
imities. 

By virtue of Theorem 6.5, the limiting accessibility measures only "marginally" 
correspond to the conception of accessibility that underlies the above condi- 
tions. 

Proof. The nonstrict satisfaction of the conditions listed in the theorem fol- 
lows from Theorem 6.3, Proposition 6.4 and item 5 of Theorem 4.1. To 
prove that the strict forms of these conditions and the "=^" part of reach- 
ability condition are violated, it suffices to consider the digraph T with n > 3, 
E(T) = {(1,2), (2, 3)}, and w 12 = w 23 = 1. □ 

Let us mention one more class of accessibility measures, (/ + a J)" 1 , < 



a < a (d') l a (d'+i)- These measures are "intermediate" between P° nt and P out ; 
because they are positive linear combination of Jr d ,s and Jr^'+i) [3]- That is 
why we termed them the matrices of dense out-forests. In the terminology of 
[30, p. 152], (I + a J)~ x with various sufficiently small a > make up a class 
of nonnegative nonsingular commuting weak inverses for L. These measures 
and the dual measures have been studied in [3] (see also [10, p. 270-271]). 
Other interesting related topics are the forest distances [11] and the forest 
based centrality measures [15]. 

7 Rooted forests and the problem of leaders 

Ranking from tournaments or irregular pairwise contests is an old, but still 
intriguing problem. Its statistical version is ranking objects on the base of 
paired comparisons [21]. Analogous problems of the analysis of individual and 
collective preferences arise in the contexts of policy, economics, management 
science, sociology, psychology, etc. Hundreds of methods have been proposed 
for handling these problems (for a review, see, e.g., [21,22,19,4,14,18,25]). 

In this section, we consider a weighted digraph V that represents a compe- 
tition (which need not be a round robin tournament, i.e., can be "incomplete") 
with weighted pairwise results. The digraph can also represent an arbitrary 
weighted preference relation. The result we present below can be easily ex- 
tended to multidigraphs. 

One of the popular exquisite methods for assigning scores to the partic- 
ipants in a tournament was independently proposed by Daniels [20], Moon 
and Pullman [31,32], and Ushakov [38,39] and reduces to finding nonzero and 
nonnegative solutions to the system of equations 

(14) Lx = 0. 

Entry Xi of a solution vector x = (x 1 , . . . ,x n ) is considered as a sophis- 
ticated "score" attached to vertex %. This method was multiply rediscovered 
with different motivations (some references are given in [18]). As Berman 
[5] noticed (although, in other contexts, similar results had been obtained by 
Maxwell [29] and other writers, see [7]), if a digraph is strong, then the general 
solution to (14) is provided by the vectors proportional to t = (t 1 , . . . ,t n ) T , 
where tj is the weight of the set of spanning trees (out-arborescences) diverg- 
ing from j. This fact can be easily proved as follows. By the matrix-tree 
theorem for digraphs (see, e.g., [24]), t ■ is the cofactor of any entry in the jth 
column of L. Then for every i G ^(r), Y^=i ^ij h = det (^ ne row expansion 
of detL) and, since detL = 0, t is a solution to (14). As rank(L) = n — 1 



(since the cofactors of L are nonzero), any solution to (14) is proportional to t. 

Berman [5] and Berman and Liu [6] asserted that this result is sufficient to 
rank the players in an arbitrary competition, since the strong components of 
the corresponding digraph supposedly "can be ranked such that every player 
in a component of higher rank defeats every player in a component of lower 
rank. Now by ranking the players in each component we obtain a ranking of 
all the players." While the statement about the existence of a natural order 
of the strong components is correct in the case of round-robin tournaments, 
it need not be true for arbitrary digraphs that may have, for instance, several 
source knots. That is why, the solution devised for strong digraphs does not 
enable one to rank the vertices of an arbitrary digraph. 

Let us consider the problem of interpreting, in terms of forests, the general 
solution to (14) and the problem of choosing a particular solution that could 
serve as a reasonable score vector in the case of arbitrary digraph T. 

If T contains more than one source knot, there is no spanning diverging 
tree in T. Recall that K l , . . . , K d , are the source knots of T, where d' is the 
out-forest dimension of T, and K = |J S=1 K a . 

Suppose, without loss of generality, that the vertices of V are numbered as 
follows. The smallest numbers are attached to the vertices in K l , the following 
numbers to the vertices in K 2 , etc., and the largest numbers to the vertices in 
V(T) \ K. Such a numeration we call standard. 

Theorem 7.1 Any column of J is a solution to (14)- Suppose that the 
numeration of vertices is standard and j x G K 1 ,...,j d , G K d , . Then the 
columns J. , . . . , J • of V make up an orthogonal basis in the space of solu- 

tions to {14) and J. js = w~\T s )(0, . . . , 0, w{T> +1 ), w(T> +ka ), 0, . . . , 0) T , 
where {i s +1, . . . , i s +k s } = K s and % is the set of out-arborescences of K s , 
s = 1, . . . , d' . 

By virtue of Theorem 7.1, the general solution to (14) is the set of all linear 
combinations of partial solutions that correspond to each source knot of V. 

Proof. The first statement follows from LJ = (item 3 of Theorem 4.1). By 
item 6 of Theorem 4.1, rank(J) = d' and rank(L) — n — d' . Hence, d! is the 
dimension of the space of solutions to (14). Let j s G K s , s = 1, . . . , d'. Then, 
by items 1 and 2 of Theorem 4.2, 

J. js = w-\T s )(0, . . . , 0, w{7? +1 ), . . . , w{T s ° + H 0, • • • , 0) T . 

These d' solutions to (14) are orthogonal and thus, linearly independent. □ 



As a reasonable ultimate score vector, the arithmetic mean x — -J • 

____ ' n 

(1, . . . , 1) T of the columns of J can be considered. A nice interpretation of 
this vector is given by 

Corollary 7.2 (from Theorem 4.3). For any Markov chain inversely corre- 
sponding to T, x = y t J • (1, . . . , 1) T is the limiting state distribution, provided 
that the initial state distribution is uniform. 

It can be mentioned, however, that the ranking method based on J takes 
into account long paths in V only. That is why, in any solutions to (14), the 
vertices that are not in the source knots are assigned zero scores, which is 
questionable. The estimates based on the matrices Q(r), instead of J, are free 
of this feature. On the other hand, both methods violate the self- consistent 
monotonicity axiom [18], and so do the methods that count the routes between 
vertices. This axiom is satisfied by the generalized Borda method [12,13] that 
produces the score vectors J'(r)-(od(l) — id(l), . . . , od(n) — id(n)) T , where J'ij) 
is the matrix J(r) of the undirected graph corresponding to Y [37]. In our 
opinion, the latter method can be recommended as a well-grounded approach 
to scoring objects on the base of arbitrary weighted preference relations, in- 
complete tournaments, irregular pairwise contests, etc. 

A concluding remark: a communicatory interpretation 
of some forest matrices 

In closing, let us mention an interpretation of forest matrices in terms of 
information dissemination. Consider the following metaphorical model. First, 
a plan of information transmission along a digraph is chosen. Such a plan is 
a diverging forest F £ JF*^: the information is injected into the roots of F; 
then it ought to come to the other vertices along the arcs of F . Suppose that 

£]0, 1] is the probability of successful information transmission along the 
(i,j) arc, i,j £ V(T), and that the transmission processes in different arcs 
are statistically independent. Then w(F) is the probability that plan F is 
successfully realized. Suppose now that each plan is selected with the same 
probability |jF*^| _1 . Then J ^ (see (6)) is the probability that the information 
came to j from root i, provided that the transmission was successful. As a 
result, if one knows that the information was corrupted at root i and the 
transmission was successful, then is the probability that this corrupted 
information came to j. 

Similarly, interpretations of this kind can be given to other normalized 



forest matrices. This model is compatible with that of centered partitions [28] 
and comparable with some models of [33]. 
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